CDS

Accession Number TCMCG058C01451
gbkey CDS
Protein Id KAF7113725.1
Location complement(join(5276..5609,6209..6306,6443..6613,7476..7591,8136..8533,8851..8996,9732..9890,10914..11078))
Organism Rhododendron simsii
locus_tag RHSIM_RhsimUnG0109100

Protein

Length 528aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA588298, BioSample:SAMN13241185
db_source WJXA01000267.1
Definition hypothetical protein RHSIM_RhsimUnG0109100 [Rhododendron simsii]
Locus_tag RHSIM_RhsimUnG0109100

EGGNOG-MAPPER Annotation

COG_category K
Description Transcription initiation factor IIF subunit
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko03021        [VIEW IN KEGG]
KEGG_ko ko:K03138        [VIEW IN KEGG]
EC -
KEGG_Pathway ko03022        [VIEW IN KEGG]
map03022        [VIEW IN KEGG]
GOs GO:0003674        [VIEW IN EMBL-EBI]
GO:0005488        [VIEW IN EMBL-EBI]
GO:0005515        [VIEW IN EMBL-EBI]
GO:0008022        [VIEW IN EMBL-EBI]

Sequence

CDS:  
ATGTCGTTCGATTTGAAGCCGTGTTGTGGCGGCTGTGGATCGTCTAAGGATCTTTATGGAAGCAATTGCAAGCACTTGACTATGTGCTTCGCTTGTGGCAAAACCATGGCCGAGAACGGTGGTAAATGCTGCGAGTGCGGCGCCACCATCACTCGCTTGATTCGGGAATATAAAATCCGGGCATGTTCCAGCAGCGACAAGAACTTCTTCATTGGTAGGTTTGCTACGGGTTTACCAATTATTTCAAGCAAGCGGAATGCCCAGAACAAATGGTCTCTCGTGAAAGAAGGATTACTAGGTCGCCAACTAACTGAAGCTTTGCGGGAGAAATACAAGAATAAACCTTGGCTATTGGTGAACGAAATGGGCCAATCTCAGTACCATGGTTACCTTGAGGGTGCACCATCAGCATCTTACTACCTACTAATGATGCAGGGGAAGGAGTTTGTTGCTATTCCTGCAGGTTCTTGGTACAACTTTAACAAAGTTGCACAATATAAGCAACTTACCTTGGAGGAAGCTGAAGAGAAAATGAAAAATAGAAGAAAAACTGCAGATGGGTATCAAAGATGGATGATGAAAGCTGTGAACAATGGACCTGCGGCATTTGGCGAAGTAGAAAAGATTGATGATAAGGAAGGTGGGGCTAGTTGTGGAAGAGGACGTAAAAAACCCAGAGGGGATGATGGTGAAGCTAATGCTTCAGATAGTGGAGAAGATGATGAAGAGGAACAGGAAGAGGCAGCGAGAAACATTATAGGTGGTGGGCTCTTCAGAAGAGGCAATAACGACGAAGAAGAAGGTGCAAGAGGAGGTGACCTTGATCTGGATGATGATGACGATTTTGAGAAGGGCGAGCATTTGCCTTGTGATGATTGGGAGCATGAAGAGACCTTTACTGATGATGATGAAGCTGTGGGTAATGATCCCGAGGAACGAGAAGATTTAGGCCCAGAGATTTCTGCCCCTCCAGAAATTAAACAGGATGAAGATGAAACAAATGAAGAGGAGGGAGGATTGAGCCAATCTGGAAAAGAGTTGAAGAAGCTGCTCGAGCGAGCTTGTGGTCCGAATGTTATGGTTGTAGAGGGTGACGATGGCAATGATGATGCAAGTAACTATGGGAAGGCGGGCTCCTTACCATTTTTATGTGTTATAAATGATGATATTTCACCGGTGTTGGCTCCTAATAAGAAAGATGCTCCGAAAGAAGGACCTGCCAGTAACGTCAAGGGAACCCCAACTATCTCTAATATGAGTTTCCTTGATATTCCAAAGCTTACTCTAGACGTACCTTATTTTCATTTTTCTTTCGTGTTGGAGCAGGAGGTGAAATCTTCGAAAGATAAACCTTCGTCATCTTCGAAACATGCATCGACTGATTCATCTGCTGGACCTGTCACAGAAGATGAAATCCGGGCTGTTCTATTGCAAAAGTCTCCTGTGACTGCTAAGGATCTTGTCAATGTGTACTTTAAAGCAAGACTACGATGTGACAAGGTATGCCTTTCTTGTTTTGTCAGGTTCTATGCAATAAACTCCAGAAAACTTGAACTGTCTTTGTTTGAAATATCAAAAAAGACCTGA
Protein:  
MSFDLKPCCGGCGSSKDLYGSNCKHLTMCFACGKTMAENGGKCCECGATITRLIREYKIRACSSSDKNFFIGRFATGLPIISSKRNAQNKWSLVKEGLLGRQLTEALREKYKNKPWLLVNEMGQSQYHGYLEGAPSASYYLLMMQGKEFVAIPAGSWYNFNKVAQYKQLTLEEAEEKMKNRRKTADGYQRWMMKAVNNGPAAFGEVEKIDDKEGGASCGRGRKKPRGDDGEANASDSGEDDEEEQEEAARNIIGGGLFRRGNNDEEEGARGGDLDLDDDDDFEKGEHLPCDDWEHEETFTDDDEAVGNDPEEREDLGPEISAPPEIKQDEDETNEEEGGLSQSGKELKKLLERACGPNVMVVEGDDGNDDASNYGKAGSLPFLCVINDDISPVLAPNKKDAPKEGPASNVKGTPTISNMSFLDIPKLTLDVPYFHFSFVLEQEVKSSKDKPSSSSKHASTDSSAGPVTEDEIRAVLLQKSPVTAKDLVNVYFKARLRCDKVCLSCFVRFYAINSRKLELSLFEISKKT